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We develop a method of constructing percolation clusters that allows us to build very large 
clusters using very little computer memory by limiting the maximum number of sites for which we 
maintain state information to a number of the order of the number of sites in the largest chemical 
shell of the cluster being created. The memory required to grow a cluster of mass s is of the order 
of s e bytes where 6 ranges from 0.4 for 2-dimensional lattices to 0.5 for 6- (or higher)-dimensional 
lattices. We use this method to estimate d m i n , the exponent relating the minimum path £ to the 
Euclidean distance r, for 4D and 5D hypercubic lattices. Analyzing both site and bond percolation, 
we find d min = 1.607 ± 0.005 (4D) and d min = 1.812 ± 0.006 (5D). In order to determine d mul to 
high precision, and without bias, it was necessary to first find precise values for the percolation 
threshold, p c : p c = 0.196889 ± 0.000003 (4D) and p c = 0.14081 ± 0.00001 (5D) for site and p c = 
0.160130±0. 000003 (4D) andp c = 0.118174±0.000004 (5D) for bond percolation. We also calculate 
the Fisher exponent, r, determined in the course of calculating the values of p c : r = 2.313 ± 0.003 
(4D) and r = 2.412 ± 0.004 (5D). 



I. INTRODUCTION 

Percolation is a standard model for disordered systems 
In percolation systems, sites or bonds on a lattice 
are populated with probability p. The value of p at which 
infinite clusters are formed is known as the critical prob- 
ability or percolation threshold p c . The shortest path 
exponent, d m ; n , is defined by the relation 

(^r^, (1) 

where r is the Euclidean distance between two sites on a 
cluster and £ is the length of the shortest path traveling 
along occupied sites and bonds in the percolation cluster 
( "chemical distance" ) . We can also write 

(r)~^, (2) 

which defines the exponent z — l/d m i n . With the excep- 
tions of d > 6 (where z — 1/2) and d = 1 (where z = 1), 
z is not known exactly. The most common method of 
determining z numerically (and the one we will use) is 
to grow clusters, calculating the average distance (r) of 
sites in the cluster from the seed of the cluster as a func- 
tion of chemical distance, £, from the seed. In order that 
finite size effects do not play a role, the lattice must be 
large enough such that the clusters which are grown do 
not reach the boundaries of the lattice. 

Because corrections-to-scaling decrease with increasing 
£, the larger the value of £ max (the value of £ at which 



we stop the growth), the more accurately we can esti- 
mate z. The limitations on the size, € max , to which the 
clusters can be grown have been the computer memory 
available for the simulation and the computer process- 
ing power needed to build these clusters. The method 
of "data blocking" |||| has helped ameliorate the need 
for large amounts of memory. In this method, the lattice 
is logically divided into blocks; memory for a block is 
not allocated until the lattice grows into that block. The 
data blocking method has been used recently to obtain 
precise estimates for the percolation threshold and associ- 
ated exponents for bond and site percolation on a number 
of lattices ||f7| . Ultimately, however, although sufficient 
computer power is available to build larger clusters, the 
cluster size is limited by the amount of memory available. 
This becomes particularly true as the dimension of the 
lattice d increases since at criticality the cluster becomes 
less dense as d increases ||. To reach the same cluster 
mass or £ max , we must have larger lattices. 

In this paper we describe a method of constructing 
clusters which dramatically reduces the memory require- 
ments needed to grow large clusters relative to previous 
methods. Using this method of building large clusters, 
we estimate z for hypercubic lattices in 4, and 5, di- 
mensions. The study of critical properties in higher di- 
mensions is important because one can use the results to 
test relations which are conjectured to hold in all dimen- 
sions (hyperscaling relations) and exponents which are 
believed to be the same in all dimensions (superuniversal 
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exponents) . The current best estimates of d m i n for 4 and 
5 dimensions, 1.5 and 1.8, respectively 0], are of rela- 
tively low precision compared to the estimates available 
in 2 and 3 dimensions 1.1307 ± 0.0004 and 1.374 ± 0.006, 
respectively B||. 

II. CLUSTER GENERATION 

One method of cluster generation is the Leath method 
H . In this method a site is chosen as the seed site of the 
cluster. Using a random number generator and a given 
bond occupation probability, one determines whether the 
bonds connected to the seed site are occupied or not jlo| . 
If a bond is occupied, the site to which this bond con- 
nects is considered to be part of the cluster and becomes 
a "growth site." These sites are at chemical distance of 
unity from the seed site; all sites at the same chemical 
distance, £, from the seed site are considered to be in 
"chemical shell I." The process is then repeated for each 
of these growth sites with the next set of growth sites be- 
ing at chemical distance 2 from the seed site. The cluster 
continues to grow until the growth stops naturally, the 
growth is terminated by the sides of the d-dimensional 
lattice of edge L, or the maximum chemical distance, 
f max , is reached. 

We use the Leath method to construct clusters, but we 
keep track of which bonds are occupied and which sites 
have been visited by a method different from that tra- 
ditionally used. Traditionally, this state information is 
stored in an array of size equal to the number of lattice 
sites. In the data blocking method, memory usage can be 
improved by allocating blocks in this array dynamically. 
Vollmayr jll| eliminated the use of this array, storing 
status of visited sites in a data structure thus reducing 
memory requirements to grow a cluster of mass s to 0{s). 
We extend the approach of Ref. (ll]] further, reducing the 
memory required to 0(s ) where 9 ranges from 0.4 for 2- 
dimensional lattices to 0.5 for 6- (or highcr)-dimensional 
lattices. 

To see how this can be done, we first consider the uses 
of this state information: 

(a) Occupancy status: Information concerning whether 
a site/bond is occupied is maintained so that it 
is the same, independent of when in the growth 
process it is accessed. For example, we would not 
generate a cluster with the proper statistics if we 
treated a bond as occupied during one stage of the 
cluster growth and then treated it as empty during 
a later stage. 

(b) Visited status: Information concerning whether a 
site has been visited or not is maintained in order 
that (a) we do not multiply count the presence of 
a site in the cluster and (b) we do not retrace our 
steps during cluster generation, causing the growth 
process to never end. 



A. Occupancy Status 

We address the need to maintain information about 
whether a bond is occupied or not by using a random 
number generation scheme in which the random number 
associated with a bond is determined by the location of 
the bond in the lattice and the orientation of the bond. 
This is done by first assigning a unique number n to 
any site in the lattice as follows: Let (x%, X2, x 3 . . . Xd) 
be the coordinates of the site in the lattice, and let 
(Li, L2, L3 . . . Ld) be the lengths of the sides of the lat- 
tice. Then 

n(x 1 ,x 2 ,x 3 ...x d ) = 

[({[(X1L2) + x 2 ]L 3 } + x 3 .. .)L d + x d ] (3) 

assigns a unique number to any site in the lattice. We 
assign a unique number, n' , to any bond in the lattice by 
defining 

n'{xi,x 2 ,x 3 ...x d ,o) = [n(xi,x 2 ,x 3 . ..x d )d] + o, (4) 

where o is the orientation of a bond attached to site x 
(assuming values to d — 1). 

Furthermore we want to assign unique numbers to 
bonds over many different realizations. We then define 

n"(xi, X2, x 3 . . .x dl o,m) = 

[n'(xi,X2,x 3 ...Xd,o)M]+m, (5) 

where m is the number of the realization and M is the 
maximum number of realizations we plan to create. 

We then generate a 64-bit random number, R, using 
an encryption- like algorithm f{n") []l2| using n" as its 
input, 

R = f(n"). (6) 

A bond is occupied if R > 2 6A p. In practice, because 
for large lattices and a large number of realizations n" 
is greater than 2 64 , the maximum size of the input to 
the random number algorithm, we actually determine the 
random number in two steps, 

R = f({[f(n)d]+o}M + m). (7) 

That is, we first create an intermediate random num- 
ber based only on the coordinates of the bond and then 
create the final random number based on the interme- 
diate random number, the orientation of the bond and 
the realization number. Using the test described in [ jl3| , 
we confirm that, within statistical error, our algorithm 
generates unbiased random numbers. This test is im- 
portant because there is only a small difference between 
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the inputs to the random number generator for neighbor- 
ing sites. Any correlations between the outputs would 
cause incorrect results [B. The generation of random 
numbers using Eq. Wj is slower than congruence or shift 
register techniques |13 ] but is somewhat compensated by 
eliminating the processing done to store and access bond 
state when maintained in an array. In any case, the net 
effect of using this approach is about a factor of 5 in- 
crease in calculation time because of the slowness of the 
encryption-like random number generator that we used. 



B. Visited Status 

We address the need to maintain information about 
whether a site has been visited or not by storing infor- 
mation about visited sites in a data structure. Each entry 
in the data structure contains the coordinates of the site, 
the chemical shell of the site, and a bit map with one bit 
for each direction from which the site can be visited. The 
data structure can be accessed as a "circular list" (last- 
in-first-out queue) so entries can be added and deleted. 
Since a site can be visited from different directions, we 
must ensure that a site is counted only once and that 
backtracking does not occur. To accomplish this, before 
adding a site to the list of growth sites, we first check to 
see if it is already in the list. 

• If it is already on the list, we do not add a new en- 
try but, in the entry for the site already in the list, 
we do set the bit corresponding to the orientation 
of the connected bond which was traversed to visit 
the site. 

• If it is not in the list, we add it (storing the co- 
ordinates and chemical length and setting the bit 
corresponding to the direction from which the site 
was visited). 

When we are about to process the entry for a growth 
site, we only count the site once in the mass of the clus- 
ter, and only attempt to grow the cluster in directions 
other than those from which the site was visited. In this 
way we avoid backtracking along already traveled paths. 
If the data structure had to be searched sequentially ev- 
ery time we were about to create a growth site, the time 
needed would make this approach impractical. In [pj, 
the data structure was maintained as a binary tree in or- 
der to reduce search time. We use the faster "hash table" 
method |Q to access entries for the visited sites. 

The hashing technique works as follows: A key, K, is 
associated with each entry of the data structure. We use 
a function h(K) to map the key into a "slot" at offset 
h(K) in a "hash table" . If the slot in the table is not 
already used, we store the number or address of the en- 
try in this slot; if the slot is used (this is referred to as a 
"collision" ) we add the entry to a chain of entries all of 
which map to the same value h(K). Ideally the function 



h(K) maps the keys uniformly over the slots in the table 
so we obtain few long chains. If we use a hash table of 
size M = 2 m , where m is an integer and choose K as 
the unique number, n, of the site, an effective hashing 
function is 

h (n) = [(« C) mod 2 W ] , (8) 

where w is the word size(in bits) of our computer and 
the hash constant, C is the least significant w bits of the 
product of 2 W and the "golden ratio", {Vb - l)/2. Thus 
h(n) yields the upper m bits (shift right w — m bits) of 
the result of taking the lower w bits of the product of 
the unique site number and the hash constant, C . We 
implement the ability to chain entries in the data struc- 
ture by defining another field in the data structure entry 
which serves as a chain pointer field. To find an entry in 
the data structure for a site, we calculate the unique site 
number using Eq. (0), find the offset in the hash table us- 
ing Eq. (JsJ) , and then walk the chain of entries to find the 
entry with the desired coordinates. If we make the size of 
the hash table equal to the size of the site data structure, 
we find the average number of hash "collisions" to be less 
than 2 so we can determine if a site has been visited very 
efficiently. 

This approach of keeping the status of visited sites in 
a special data structure (not in the lattice array) applies 
to any lattice model. In the case of growing percolation 
clusters we can further reduce the amount of memory 
needed significantly. This is accomplished by recognizing 
that a site which is multiply visited is done so during the 
growth of a single chemical shell. This is the key insight 
that allows us to reduce the memory requirement and can 
be confirmed by considering the bonds adjacent to a site 
in a lower chemical shell: (i) an occupied bond adjacent 
to a site in a lower chemical shell cannot be a path to 
re-visit that site because we do not back-track and (ii) 
an unoccupied bond adjacent to a site in a lower chemi- 
cal shell cannot be on the path to revisit that site. Sites 
in the same chemical shell can, however, be visited by 
multiple paths as shown in Fig. [j](a). Thus we need only 
keep state information about growth sites which them- 
selves have not yet been used to create entries for the 
next chemical shell. The number of such sites at any 
point in the growth process will be of the order of the 
size of the current chemical shell. 

The discussion so far has been for hypercubic lattices. 
For these lattices, we ensure that we did not double- 
count site or backtrack by maintaining information about 
growth sites which themselves have not yet been used 
to create entries for the next chemical shell and then 
checking for duplicates. More generally (e.g., for trian- 
gular lattices), the situation is a little more complicated 
as shown in the example in Fig. 0(b). A similar situa- 
tion is shown in Fig. |l|(c), where we grow a cluster from 
multiple seeds. To treat both types of situation, we must 
maintain (i) state information about growth sites which 
themselves have not yet been used to create entries for 
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the next chemical shell and (ii) state information about 
all sites in the chemical shell previous to the one being 
built. Before we add a site to the list of growth sites, we 
check if it is already present in the previous shell; if it is, 
we do not add it. 

The size of a chemical shell can be estimated as follows. 
The chemical distance, £, scales with the mean Euclidean 
radius of the cluster, r, as 



(9a) 



while the cluster mass (the number of sites in the cluster), 
s, scales as 

s~ r d f, (9b) 

where <i m in has values 1.13 and 2 for d = 2 and 6, respec- 
tively P,p|, ^5|p6| ; df, the fractal dimension of the cluster 
mass, has the exact values 91/48 = 1.89 and 4 for d = 2 
and 6, respectively WW. Then 



and 



ds ~ ^//^min)- 1 ^ — ( s d min/ d /)( d // d min)- 1 ^ ; 



s l-(«Wn/ d /)<#. (11) 

Setting dt = 1 , we find the size of the outermost chemical 
shell of a cluster of mass s scales as 



Sshell(s) 



(12) 



where 8 — 1 — d m { n /df. 

The values of 9 range from w 0.4 to 0.5 for d — 2 to 
d — 6. Thus the size of the data structure to contain the 
visited status is only of the order of the square root of 
the size of the cluster size because we only store status 
for the largest chemical shell. 



one proposed in this work is that the state of all sites are 
recorded (as described in appendix A), so it allows us- 
ing a fast random number generator. The data blocking 
method method allows lattices of sufficient size to keep 
finite-size effects under control, with sufficient speed to 
achieve good statistics. (The hashing method described 
in this paper could also have been used for this calcula- 
tion.) 

In 4D, we use a virtual lattice of (512) 4 sites, broken up 
into blocks of (16) 4 sites each. In 5D, the virtual lattice of 
size (128) 5 is divided into blocks of size 8 5 . The cluster- 
size cutoff, s max , is 2 17 = 131,072 and 2 14 = 16,384 for 
4D and 5D, respectively. The threshold is determined as 
the value of p that leads to the cluster size distribution n s 
best following a power-law n s ~ s~ T . Simulating about 
10 8 clusters for each case, and using the data analysis 
techniques employed in H, we find 



Pc = 



0.196889 ±0.000003 
0.160130 ±0.000003 
0.14081 ±0.00001 
0.118174 ±0.000004 



[4D site] 
[4D bond] 
[5D site] 
[5D bond] 



Also, for r we find the values 



2.313 ± 0.003 [4D] 
2.412 ±0.0004 [5D] 



(13) 



(14) 



These results are more precise than some of the published 
values for p c = 0.16005 ± 0.00015 @, 0.1407 ± 0.0003 
§Jj, and 0.11819 ± 0.00004 @ for 4D bond, 5D site, 
and 5D bond percolation, respectively and for r = 2.41 
for 5D percolation; for 4D site percolation, Ballesteros 
et al. p9| ] found the comparably precise value p c — 
0.196901 ± 0.000005 (just slightly higher than ours) and 
t = 2.3127 ± 0.0007. All simulation parameters and our 
results are summarized in Table I. 

The precision of our results is sufficiently high that we 
expect that statistical errors in p c will not have an effect 
on our value of d m i n . 



III. PERCOLATION THRESHOLD AND FISHER 
EXPONENT 

In order to determine d m i n to high precision and with- 
out bias, it is necessary to first find values of the percola- 
tion threshold substantially more precise than previously 
known (in most cases). To determine these thresholds 
we used the method of measuring cluster-size statistics 
of individual clusters grown on large virtual lattices as 
described in ||. The data-blocking method || used in- 
volves assigning memory to parts of the lattice only when 
the cluster grows into it. With the data-blocking method, 
like the hashing method, a table is used to access a data 
structure but in this case, the data structure entries rep- 
resent blocks of sites instead of individual sites; there are 
no collisions, but some memory is wasted. The advan- 
tage of using the data-blocking method as opposed to the 



IV. SHORTEST PATH EXPONENT 

To calculate the shortest path exponent, we ran sim- 
ulations at the percolation thresholds found above. We 
stopped cluster growth at ^ max = 2048 for 4D bond and 
site percolation and £ maK = 1024 for 5D bond and site 
percolation. We simulated 73 x 10 6 , 39 x 10 6 , 105 x 10 6 , 
and 20 x 10 6 realizations for 4D bond, 4D site, 5D bond, 
and 5D site percolation, respectively. During our sim- 
ulations, we kept track of the maximum and minimum 
lattice points to which our clusters extended. Using this 
information, we determined the size of the lattice that we 
would have needed to build if we had been using conven- 
tional memory techniques. For d = 5 the lattice would 
have had sides of length L = 245 resulting in approxi- 
mately 900 x 10 9 lattice sites (w 1TB memory); the ac- 
tual memory used was less than as 10 6 (1MB), six orders 
of magnitude smaller. 
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Figure ||a shows plots of (r) for 4D site and bond per- 
colation, while Fig. ||b shows plots of (r) for 5D site 
and bond percolation. While the plots resemble straight 
lines, the effects of corrections-to-scaling are, in fact, con- 
siderable. One customarily assumes_that corrections-to- 
scaling have the functional form [111— 



(15) 



two point at £ and 2£ is the value of the slope between 
these points in a log- log plot of (r(£)} 



(r) ~f(l + Ar A + ---) ; 



where the constant A depends on the dimension, lattice 
type and percolation type (bond or site) but the expo- 
nent A depends only on dimension. Let 



h(£) 



(r) 



(l + Ar A + 



(16) 



where z' is an estimated value of z. If £ max were infinitely 
large, we could determine z as the value of z' , which re- 
sults in a plot of h{£) which asymptotically approaches a 
constant (i.e., has zero slope as I — > oo); however, since 
f M is finite, we may obtain misleading results if we de- 
termine z in this manner. Nevertheless, we can use this 
approach to determine bounds on z. 

To see how this is accomplished, first consider Fig. ^a, 
in which we plot h(£) for 4D bond percolation for vari- 
ous values of z'. From this figure and Eq. ( |l6| ) it is clear 
that A is positive. Hence, we know that if for large £ the 
slope of h(£) becomes an increasing function, the lead- 
ing power-law term £ z ~ z will dominate because z > z' . 
Thus a lower bound on z is that value of z' at which h(£) 
asymptotically becomes an increasing function. From 
Fig. |a this value is 0.620. 

We can proceed similarly by considering site percola- 
tion in 4D, plotting h(£) for 4D site percolation for var- 
ious values of z' in Fig. |^b. From these plots it is clear 
that A for bond percolation is negative. Hence we know 
that if for large £ the slope of h(£) becomes a decreasing 
function, we are seeing the leading power-law term £ z ~ z 
dominate because z < z'. Thus an upper bound on z is 
that value of z' at which h(£) asymptotically becomes a 
decreasing function. From Fig. ||b this value is 0.625. 

Proceeding in the same manner for site and bond per- 
colation in 5D (see Fig. ||a,b), we find that the constant 
A is positive for both bond and site percolation, allowing 
us to determine only an upper bound of z — 0.5515 (the 
lower of the upper bounds for site and bond percolation) . 
While this method of finding bounds on z by identify- 
ing the value of z 1 at which the slope of h(£) changes 
sign does not always yield both upper and lower bounds, 
it has the advantage that it does not require any esti- 
mation of the parameters A and A in Eq. (12) and, in 
fact, is somewhat insensitive to the exact form of the the 
corrections-to-scaling terms. 

We also analyze our data using another more 
commonly- used method j§n§]- That method is to plot 
the effective exponents, z{£), between points £ and 2£ ver- 
sus £~ A using an estimated value of A which yields the 
straightcst line. The effective exponent, z(£), between 



z(£) = 



ln[(r(2£))] - H(r(£))} ln[(r(2£))/(r(£))} 



ln[2£] - \n[C] 



ln[2] 



(17) 



The £ = intercept of a plot of z(£) will be an estimate 
for z, and the slope will be proportional to A. Our best 
estimate for A for d = 4 and d = 5 is 0.4 < A < 0.6, so we 
use a value of A of 0.5 and plot z(£) for 4D site and bond 
percolation in Fig.Jqa and 5D site and bond percolation 
in Fig. ||b. In Fig. ||a, the fact that the slopes of the lines 
change suggests that we are seeing the effects of both the 
correction-to-scaling term in Eq. (|15| ) as well as higher or- 
der terms which become less significant at larger values 
of £. In general, it is more efficient to generate smaller 
clusters and more of them, rather than fewer, larger ones. 
However, if the corrections-to-scaling are not well under- 
stood or large, then one must build the largest clusters 
possible. As we see here, strong corrections-to-scaling 
are present in 4D percolation where the plots of effective 
slope change at large £. If we had used smaller clus- 
ters using traditional memory-management techniques 
we would have obtained incorrect results. In Fig. |^a, 
the almost horizontal plot for site percolation indicates 
that the amplitude, A, of the correction-to-scaling term 
is very small. From these plots and our estimates above 
of bounds on z, we estimate 



0.622 ± 0.002 [4D] 
0.552 ± 0.002 [5D] 



In terms of c? m i n , this corresponds to 



1.607 ± 0.005 [4D] 
1.812 ±0.006 [5D] 



(18) 



(19) 



The previously published values for d m i„ are 1.5 and 1.8 
for 4D and 5D |1|] . Thus our estimates of d m i n are of con- 
siderably higher accuracy than the existing ones and have 
accuracy comparable to that for the estimate of d m in hi 
3 dimensions, 1.374 ± 0.006 ||. Our results and all sim- 
ulation parameters are summarized in Table II. 



V. DISCUSSION 

We have developed a technique which allows us to build 
very large percolation clusters using very little mem- 
ory. In fact, using the method described here, relative 
to computer processing power available today and in the 
foreseeable future, computer memory is no longer a con- 
straint on building percolation clusters near the perco- 
lation threshold. The critical computer resource thus 
becomes solely processing power. For example, by ex- 
trapolating from our simulations, we find that with our 
method, with less than 10 8 bytes of memory, we could 
build a 5D cluster of 10 12 sites, which would have re- 
quired a lattice of 10 17 sites, and reach a value of £ max of 
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10 7 (versus the 1024 cutoff we used in our simulations). 
But the time to build a single trillion-site cluster would 
be about 2000 hours on current workstations. As proces- 
sor speeds increase, our technique for reducing memory 
usage should allow critical exponents and constants to be 
determined with greater precision. Current techniques of 
growing clusters, including the one described in this pa- 
per, require computer processing resource of O(s), where 
s is the size of the cluster grown. 

We note that the technique we have developed is use- 
ful when we can count the quantities in which we are 
interested as we build the cluster (e.g., cluster mass, av- 
erage distance to sites in a chemical shell) . On the other 
hand, it is not clear how we could calculate the mass 
of the backbone, for example, using our method because 
current methods of determining backbone mass require 
knowing all the sites in the cluster, not just those in the 
current chemical shell. To obtain backbone properties 
one could, however, reduce memory required to ~ s (ver- 
sus L d ) by maintaining information about all visited sites 
(not just those in the current chemical shell) in a data 
structure as opposed to maintaining the full lattice data 
structure JO ]. In Appendix A, we describe an alterna- 
tive method of cluster generation which can be used when 
information about all visited sites must be maintained. 

Finally, it is useful to compare our method with 
the Hoshen-Kopelman method pOj ] , which constructs all 
clusters in a d-dimensional lattice by successively popu- 
lating d — 1 dimensional slices of the lattice. Memory is 
used to store the last and current slice of the lattice so the 
memory needed scales as L d_1 . The Hoshen-Kopelman 
method is much less memory efficient than the method 
presented here, and becomes less effective as the dimen- 
sion increases since L d_1 /L d — > 1 with increasing d. Also, 
the Hoshen-Kopelman method cannot be used to calcu- 
late d m { n . On the other hand, the Hoshen-Kopelman 
method is better suited to other problems, such as cal- 
culating the number of clusters that span across a rect- 
angular system, than our method, based on the Leath 
algorithm. 

Appendix A: Alternative Method of Cluster Growth 

We discuss a variant of our approach in which we still 
store information concerning which sites are visited in 
a data structure and access the entries using a hash ta- 
ble. However, if one stores information about all visited 
sites, not just for those in the last shell(s), then a tra- 
ditional random number generator (one which does not 
take the coordinates/orientation of the bond as input) 
can be used. Let us first consider the case where we have 
no need for occupied bond information (e.g., we are sim- 
ply counting the number of sites in the cluster). When 
considering a growth site, if: 

(i) an adjacent site is vacant we determine whether the 
bond connected to that site is occupied or not. 



(ii) an adjacent site is not vacant we simply do not 
make a determination of whether the bond is occu- 
pied. 

In this way we make a determination about whether a 
given bond is occupied no more than once. 

Now consider the case in which we do have a need to 
know whether a bond is occupied or not (e.g., we are 
counting the number of bonds in the cluster or we will 
be determining the backbone of the cluster). In this case, 
when considering a growth site, if: 

(i) an adjacent site is vacant, we determine whether 
the bond connected to that site is occupied or not. 

(ii) an adjacent site is not vacant and is in a higher 
chemical shell, we also determine whether the bond 
is connected to that site is occupied or not. 

(iii) an adjacent site is not vacant and is in the same 
chemical shell as the growth site, we make a deter- 
mination about whether the bond is occupied only 
if the direction from the growth site to the adja- 
cent site is positive. In this way, the determination 
about whether the bond is occupied is done only 
once. This situation arises in non-cubic (e.g., tri- 
angular) lattices and when we start cluster growth 
with multiple seeds. 

(iv) an adjacent site is not vacant and is in a lower 
chemical shell than the growth site, we make no 
determination about whether the bond to that site 
is occupied; whether the bond is occupied has been 
determined earlier in the growth process. In fact, 
the bond must be unoccupied because if it were 
occupied we would have reached the growth site 
earlier directly from the adjacent site. 

Thus we ensure that we determine whether a bond is oc- 
cupied once and only once. If one needs to keep a record 
of whether a given bond is occupied (e.g., to later deter- 
mine the backbone) this information can be stored in the 
entry in the data structure for the site with which the 
bonds are associated along with the coordinates of the 
site, etc. 

This method trades off memory (we keep state for 
all visited sites) versus performance (we can use the 
faster traditional random number generators as opposed 
to the encryption-like random number generator). Also, 
in cases where we, for some other reason, must keep state 
information about all the sites, we can obtain the benefit 
of the using a faster random number generator. 
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TABLE I. Simulation parameters and results for p c and the Fisher exponent t. 



dimension 


type 


# of realizations 


^ max 


Pc 


r 




bond 


10 s 


131 073 


0.160130 ± 0.000003 




4 










2.313 ± 0.003 




site 


10 s 


131 073 


0.196889 ± 0.000003 






bond 


10 s 


16 383 


0.118174 ±0.000004 




5 










2.412 ± 0.004 




site 


10 8 


16 383 


0.14081 ± 0.00001 





TABLE II. Simulation parameters and results for the spreading exponent z and shortest path exponent d, 



dimension 


type 


Pc 


# of realizations 


£max 


2 






bond 


0.160130 


73 x 10° 


2048 






4 










0.622 ±0.002 


1.607 ± 0.005 




site 


0.196889 


39 x 10 6 


2048 








bond 


0.118174 


105 x 10 b 


1024 






5 










0.552 ± 0.002 


1.812 ±0.006 




site 


0.14081 


20 x 10 6 


1024 
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FIG. 1. Examples of cluster growth at the beginning of the population of sites at chemical distance 3 from the seed site. 
The seed sites are denoted by striped circles, (a) Example of a square lattice in which a site, C, is multiply-visited from sites 
A and B. (b) Example of a triangular lattice in which site C can be multiply-visited from sites A and D and in which site 
D can be multiply-visited from sites B and C. (c) Example in which the cluster is grown from multiple seeds. Site C can be 
multiply- visited from sites A and D; site D can be multiply- visited from sites B and C. 
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FIG. 2. Euclidean distance (r) versus chemical distance £ for site percolation (upper line) and bond percolation (lower line) 
for (a) 4D and (b) 5D. The slightly different apparent slopes of the plots for bond and site cases are due to different values of 
the correction-to-scaling parameters. 
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FIG. 3. h(l) = (r) jl z versus £ for (a) 4D bond percolation for values of (from top to bottom) z = 0.615, 0.620 and 0.625 (b) 
4D site percolation for values of (from top to bottom) z' = 0.623, 0.625, and 0.627. The dashed horizontal lines are provided 
as guides to the eye to allow one to better see that, for large £, the middle plots of h(£) in (a) and (b) are increasing and 
decreasing, respectively. 
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FIG. 4. h(£) = (r) ft versus i for (a) 5D site percolation for values of (from top to bottom) z' = 0.5510, 0.5515 and 0.5520, 
and (b) 5D bond percolation for values of (from top to bottom) z' = 0.5595, 0.5615, and 0.5635. The dashed horizontal lines 
are provided as guides to the eye to allow one to better see that, for large £, the the middle plots of h(£) in (a) and (b) are 
decreasing. 
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